Multi - label classification with Bayesian network - based chain classifiers q
نویسندگان
چکیده
In multi-label classification the goal is to assign an instance to a set of different classes. This task is normally addressed either by defining a compound class variable with all the possible combinations of labels (label power-set methods) or by building independent classifiers for each class (binary relevance methods). The first approach suffers from high computationally complexity, while the second approach ignores possible dependencies among classes. Chain classifiers have been recently proposed to address these problems, where each classifier in the chain learns and predicts the label of one class given the attributes and all the predictions of the previous classifiers in the chain. In this paper we introduce a method for chaining Bayesian classifiers that combines the strengths of classifier chains and Bayesian networks for multi-label classification. A Bayesian network is induced from data to: (i) represent the probabilistic dependency relationships between classes, (ii) constrain the number of class variables used in the chain classifier by considering conditional independence conditions, and (iii) reduce the number of possible chain orders. The effects in the Bayesian chain classifier performance of considering different chain orders, training strategies, number of class variables added in the base classifiers, and different base classifiers, are experimentally assessed. In particular, it is shown that a random chain order considering the constraints imposed by a Bayesian network with a simple tree-based structure can have very competitive results in terms of predictive performance and time complexity against related state-ofthe-art approaches. ! 2013 Elsevier B.V. All rights reserved.
منابع مشابه
Expressive Power of Binary Relevance and Chain Classifiers Based on Bayesian Networks for Multi-label Classification
A b s t r a c t . Bayesian network classifiers are widely used in machine learning because they intuitively represent causal relations. Multi-label classification problems require each instance to be assigned a subset of a defined set of h labels. This problem is equivalent to finding a multivalued decision function that predicts a vector of h binary classes. In this paper we obtain the decisio...
متن کاملMulti-label classification with Bayesian network-based chain classifiers
Article history: Available online 20 November 2013
متن کاملDecision functions for chain classifiers based on Bayesian networks for multi-label classification
Article history: Received 16 December 2014 Received in revised form 17 April 2015 Accepted 11 June 2015 Available online 23 June 2015
متن کاملTractability of most probable explanations in multidimensional Bayesian network classifiers
Multidimensional Bayesian network classifiers have gained popularity over the last few years due to their expressive power and their intuitive graphical representation. A drawback of this approach is that their use to perform multidimensional classification, a generalization of multi-label classification, can be very computationally demanding when there are a large number of class variables. Th...
متن کاملA Mixtures-of-Experts Framework for Multi-Label Classification
We develop a novel probabilistic approach for multi-label classification that is based on the mixtures-of-experts architecture combined with recently introduced conditional tree-structured Bayesian networks. Our approach captures different input-output relations from multi-label data using the efficient tree-structured classifiers, while the mixtures-of-experts architecture aims to compensate f...
متن کامل